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This paper considers the relative frequencies of distinct types 
of individuals in multitype branching processes. We prove that the 
frequencies are asymptotically multivariate normal when the initial 
number of ancestors is large and the time of observation is fixed. The 
result is valid for any branching process with a finite number of types; 
the only assumption required is that of independent individual evolu- 
tions. The problem under consideration is motivated by applications 
in the area of cell biology. Specifically, the reported limiting results 
are of advantage in cell kinetics studies where the relative frequencies 
but not the absolute cell counts are accessible to measurement. Rele- 
vant statistical applications are discussed in the context of asymptotic 
maximum likelihood inference for multitype branching processes. 

1. Introduction. The paper deals with multitype branching processes 
with discrete or continuous time assuming only the usual independence of 
the individual evolutions. Such processes are considered from a new perspec- 
tive: modeling of the relative frequencies (proportions, fractions) of different 
types of individuals (instead of their absolute numbers) as functions of time. 
Notwithstanding other possible applications, we will use the term "cell" in- 
stead of referring to an abstract individual or particle throughout this paper 
to emphasize its special relevance to cell biology. 

It is well known (see, e.g., Athreya and Ney [1] and Mode [19]) that in 
the positively regular and supercritical case the multitype population vector 
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Z(t) is asymptotically proportional (on the nonextinction set) to the left- 
eigenvector V = {vi,V2, ■ ■ ■ ,Vd) corresponding to the Perron-Frobenius root 
of the mean matrix. Then the frequency of the A;th type converges a.s. to 
Vk/ J2j=iVj (see Mode [19], Theorem 8.3, Chapter 1). Jagers [15] was prob- 
ably the first to consider relative frequencies in the context of biological 
applications. He studied asymptotic properties (as t ^ oo) of a reducible 
age-dependent branching process with two types of cells and proved conver- 
gence of their relative frequencies to nonrandom limits in mean square and 
almost surely on the nonextinction set. The usefulness of such frequencies in 
cell cycle analysis was further demonstrated by Mode [19, 20] who consid- 
ered a four-type irreducible age-dependent branching process. Some of these 
results are discussed in Section 6. The relative frequencies of distinct cell 
types play an important role in the analysis of biological studies of prolif- 
eration and differentiation of cells. As discussed in Section 2, the need for 
such characteristics of cell kinetics arises in experimental situations where 
absolute cell counts are not accessible to measurement. 

To formulate the problem in more technical terms, consider a multitype 
branching stochastic process Z{t) = {Zi{t) , Z2{t) , . . . , Z(i{t)), where Z^it) de- 
notes the number of cells of type {k = 1,2, . . . ,d) at time t. The time may 
be discrete (t £ Nq = {0, 1,2,.. .}) or continuous (t G R"*" = [0, oo)), and the 
evolution of cells may be arbitrarily complex as long as it can be described 
by a (decomposable or nondecomposable) branching process. The main fo- 
cus in the theory of branching process is on probabilistic characteristics of 
the process Z(t) and their asymptotic behavior when t ^ oo. In practical 
applications, however, the investigator is more interested in such character- 
istics at fixed time points (transient process) that provide typically much 
more information on model parameters than their asymptotic counterparts. 
It is still reasonable and useful to study another asymptotic aspect of the 
problem, namely, the behavior of Z(t) when the initial number of cells is 
large, which situation is of frequent occurrence in cell kinetics studies. This 
aspect becomes especially important when considering relative frequencies 
of cell types rather than the process Z(t) itself. 

We define relative frequencies of cell types as Afe(t) = Zk(t)/U{t),k = 
l,2,...,d, for U{t) > 0, where U{t) = I]fc=i ^fc(*) is the total number of cells 
present in the population at time t. In what follows, we assume that there 
is at least one type (say, type Ti) giving rise to all other types, regardless 
of whether the process is decomposable or nondecomposable. We assume in 
addition that the process begins with Zi (0) = N cells and study asymptotic 
properties of the above-defined fractions Ak{t;N) as A^— > oo. 

The paper is organized as follows. Section 2 provides a biological motiva- 
tion for the mathematical problem under consideration. The basic notions 
and preliminaries are introduced in Section 3. Section 4 includes Theorem 1 
stating multivariate asymptotic normality of the fractions (Ai(t; N),A2{t; N), 
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. . . , Arf(t; A^)) at a fixed time moment t > 0. In Section 5, some statistical is- 
sues having the most direct bearing on biological applications are discussed. 
Finally, some concluding remarks and suggestions are given in Section 6. 

2. Biological background and motivation. The theory of branching stochas- 
tic processes has enjoyed a long history of biological applications (see, e.g., 
Jagers [16], Yakovlev and Yanev [22], Kimmel and Axelrod [17], Haccou, 
Jagers and Vatutin [10], Yanev et al. [27], Yakovlev and Yanev [25]). This 
theory has been proven especially useful in cell kinetics studies, many of 
which call for a model with multiple types of cells. A typical example is 
provided by the process of oligodendrocyte generation in cell culture. Oligo- 
dendrocyte type-2 astrocyte progenitor cells, henceforth referred to as 0-2A 
progenitor cells, are known to be precursors of oligodendrocytes in the de- 
veloping central nervous system. When plated in vitro and stimulated to 
divide by purified cortical astrocytes or by platelet-derived growth factor, 
these cells grow in clones giving rise to oligodendrocytes. An 0-2A progeni- 
tor cell is partially committed to differentiation into an oligodendrocyte but 
it retains the ability to proliferate. Oligodendrocytes are terminally differ- 
entiated (mature) cells and they do not divide under normal conditions. At 
different time points over a period of several days after plating, the composi- 
tion of each clone is examined microscopically to count the numbers of 0-2A 
progenitor cells and oligodendrocytes per clone. A certain number N of cell 
clones, each originating from a single initiator cell, are followed-up with the 
observation process being either longitudinal or serial sacrifice, depending 
on the experimental design. Then branching process modeling is needed to 
estimate all the important but unobservable parameters, such as the mean 
mitotic cycle time, probabilities of cell division and differentiation, the mean 
life-span of oligodedrocytes, and so forth, from the observed counts of 0-2A 
progenitor cells and oligodendrocytes as functions of time. 

The first stochastic model of oligodendrocyte development in cell culture 
was proposed by Yakovlev, Mayer-Proschel and Noble [23]. The model struc- 
ture was defined following a set of assumptions that specified it as a special 
case of the Bellman-Harris branching process with two types of cells similar 
to that studied by Jagers [15]. Further studies [12, 13, 23, 24] proved this 
model to be overly simplistic, suggesting a number of refinements that have 
made it much more difficult to handle analytically and numerically. In par- 
allel, estimation techniques have been developed to fit improved versions of 
the model to various experimental data. Because of complexity of the un- 
derlying model, these techniques have been built on simulation counterparts 
of either maximum likelihood or maximum pseudo- likelihood methods. All 
specific applications of the proposed model have invariably been limited to 
statistical inference from counts of both types of cells as functions of the 
time elapsed after plating. 
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The above-described approach may not be feasible in many other experi- 
mental settings. For example, it is technically impossible to count the total 
number of cells of a given type in the blood or bone marrow in animal exper- 
iments. A similar obstacle may arise when studying suspension cell cultures 
consisting of those cell types for which no specific antibodies are available, 
thereby limiting the utility of flow cytometry in counting cell numbers. How- 
ever, this difficulty can be overcome by analyzing the proportions of different 
types of cells as long as they are morphologically distinguishable. While the 
latter expedient is routinely practiced in experimental and clinical labo- 
ratories, mathematical models of cell population kinetics are traditionally 
formulated in terms of cell counts. This call for making multitype branch- 
ing stochastic models suitable for statistical inference from relative frequen- 
cies, rather than absolute cell counts, motivated the present work. A useful 
asymptotic result reported in this paper is a first step toward pertinent 
inferential procedures. The reported result holds for any branching process 
with a finite number of cell types given that the basic postulates for the evo- 
lution of each type are all met — the multitype Bienayme-Galton-Watson, 
Markov, Bellman-Harris, Sevastyanov and Crump-Mode-Jagers processes 
representing typical examples. 

3. Basic notions and preliminary results. Consider the process Z(t) in- 
troduced in Section 1 and its probability generating function (p.g.f.) given 
by 

(1) F{t; s) = E{s^W I Zi(0) = 1} = E{sf^(*^sf • • • I ^i(O) = 1}, 

where s = (si, S2, . . . , s^) and |sfc| < 1, A; = 1, 2, . . . , d. 

By the branching property and the usual assumption of independent in- 
dividual cell evolutions, one has 

(2) FN{t; s) = E{s2W I Zi(0) = N} = s). 

It is well known that all characteristics of the process can be obtained 
from the p.g.f. (1). In the sequel, the following notation will be used: 

(3) mk{t) = E{Z^{t)\Zi{0) = l} = ^F{t;s)\s=i, k = l,2,...,d, 

OSk 

(4) bij{t) = -^—^F{t;s)\^=i, i,j = 1,2,..., d, 

where 1 = (1, 1, ... , 1). Using (3) and (4), one can obtain the following mo- 
ments: 

(5) al{t)=Yai{Zk{t)} = bkk{t) + mk{t)-ml{t), k = l,2,...,d, 

(6) Ci,{t) = Cov{Zi{t),Zj{t)} = b,j{t) - mi{t)m.j{t), i ^ j = 1,2, . . . ,d. 
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We assume that the covariance matrix C^'^\t) = \\Cij{t)\\ is finite and de- 
note its diagonal elements by Cii{t) = af{t),i = 1,2, . . . ,d. To avoid trivially 
unreasonable cases, we assume that af{t) >0,i = l,2,...,d. Then the corre- 
lation matrix H^^^t) = \\rij{t)\\ is well defined, where rij{t) = Cij{t) / ai{t)aj{t) ■ 
Cor(Zj(t), Zj{t)) and obviously rii{t) = l,i = 1,2, ... ,d. 

Introduce the total number of cells at the moment t as 

d 

(7) Uit) = Y.Zkit), 

k=l 

SO that the relative frequencies (fractions, proportions) of types Afc(t) can 
be defined on the nonextinction set {U{t) > 0} as follows: 

(8) Ak{t) = Zk{t)/U{t), k = l,2,...,d, 
with the obvious condition 

d 

(9) ^Afc(t) = l. 

fe=i 

In what follows, we will also need the following proportions: 

(10) pi{t)=mi{t)/M{t), i = l,2,...,d, 
where 

d 

(11) M{t)=¥.{U{t)} = Y,m,{t). 

i=i 

To emphasize the dependence of the process Z(t) on the initial number of 
initiator cells Zi(0) = N, we will use the notation Z(t; A^) = {Zi{t; N),Z2{t; N), 
. . . ,Z(i{t; N)). By the property of independence of cell evolutions, one has 

N 

(12) Zi{t;N) = Y,Z^^\t), i = l,2,...,d, 

k=l 

where {z[^\t)}^^-^ are i.i.d. copies of the branching process Zi{t), i = 
1,2,. ..,d. 

Our focus is on the asymptotic (as N ^ oo) behavior of the fractions 

(13) A,{t;N)=Zi{t;N)/U{t;N), i = l,2,...,d, 
where 

d N 

(14) Uit; N) = J2 Ut; N) = J2 U^'^ it) > 

i=l fc=l 

and UWit) = EtiZl'\t). 
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Proposition 1. Let mi{t) <oo,i = 1,2,..., d. Then for N^oo 

(15) q{t;N)=FT{U{t;N) =0}^0, 

(16) Ai{t;N)^pi{t) a.s.E{Ai{t;N)}^pi{t), 

(17) Yav{Ai{t;N)}^0. 

Proof. Note that by (l)-(2) and (12)-(14) one has q{t; N) = Fr{U{t; N) 
0} = q^{t) ^ as iV ^ oo, where q{t) = Pr{U{t) = 0} = F{t; 0) is the ex- 
tinction probabihty, = (0, 0, . . . , 0) is a zero- vector, and q{t) < 1 for every 
fixed t. 

From (12)-(14) and the law of large numbers (LLN), one obtains 



(18) Aiit;N) 



, AT 1 /r 1 ^ 



N 



k=l 



k=l 



■Pi{t) 



a.s. N oo. 



Put another way, the fractions Aj(t; N), i = 1, . . . ,d, are strongly consis- 
tent estimators for Pi{t) when considered as functions of the initial number 
of ancestors N . Note that for each i the quantity pi{t) may be interpreted 
as the probability for a randomly chosen cell at time t to be of the type Tj. 

By virtue of the fact that < Ai{t; N) < 1 and the dominated convergence 
theorem (DCT), it follows from (18) that E{Aj(t; A'')} converges to Pi{t) as 

— > oo, implying that Ai{t; N) is an asymptotically unbiased estimator for 
Pi{t),i = 1,2, . . . ,d. Similarly, by (18), one has that Af{t; N) -^pf{t) a.s. and, 
therefore, K{A({t; N)} ^ p^ (t) by the DCT. Hence, the result (17) follows 
immediately from (16). This concludes the proof of Proposition 1. □ 

4. Asymptotic multivariate normality of the relative frequencies. In the 

general case with ^i(O) = N, the following asymptotic results hold. 

Theorem 1. Assume af{t) < oo,i = 1, 2, . . . , d, and define A^^'-^t) = 



where 



(19) 



auit) = ai{t){l-pi{t)). 



forij^j; i,j = 1,2,... ,d. 

Then for the r.v. 

(20) Wi{t;N)=M{t)^[Ai{t;N)-pi{t)], 
the following statements are valid as — > oo; 
(i) For every i = 1,2, . . . ,d, 



1,2,. ..,d, 



(21) 



W,it;N)^Yi{t), 
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where Yi{t) is a normally distributed r.v. with M{Yi{t)} = and 

d 

(22) Sf{t) = \aT{Yi{t)} = rM{t)ak^{t)au{t). 

k,l=l 

(ii) For every k = 2,3, . . . ,d — 1 and every subset (ai,a2, ■ ■ ■ ,ak) with 
nonrecurring elements from the set {1,2, . . . ,d} , 

(23) {W^, {t;N),...,W^,{t;N))^ {Y^, (t),..., Y^^ (t)), 

where (Yq,^ (t), . . . , 1^^. (t)) have a joint normal distribution. 

(iii) The covariance matrix of the vectorY^^\t) = (li(t), l2(i)> • • • > ^fc(O) 
is given by 

(24) D^^\t) = II CoY{Y,{t),Y,{t)}\\ = [Ad^k{t)fR^''\t)Ad^k{t), 

where Adxkif) = \\0'ij{t)\\,i = 1,2, ... ,d;j = 1,2, ... ,k, is a [dx k]-submatrix 
ofA('^\t) and [Ad^k{t)r = \\aji{t)\\,j = 1,2, ... ,k;i = 1,2, ... ,d, is the cor- 
responding transposed matrix of [k x d] dimensions. The covariance matrix 
of any subvector (Yq^ (t), . . . , Yq,^ (t)) can be obtained in a form similar to 
(24). 

Proof. From (12)-(14), it follows that for every i = l,2,...,d 

Ai{t;N)-pi{t) 
N 



(25) 

a 



U{t;N) 



a,{t)[l -pi{tmit;N)-pi{t)J2aj{t)V,{t;N) 



where Viit; N) = E^=i[^f ^(0 " mi{t)]/[aiit)VN],i = l,2,...,d. 
Note that E{Vi{t; N)} = 0,yar{Vi{t; N)} = 1 and 

CoT{Vi{t;N),Vj{t;N)} = CoT{Z,{t),Zj{t)} =rij{t) =Cij{t)/ai{t)aj{t). 
Then by the CLT for i.i.d. vectors (see, e.g., [2]), one has 

(26) {Vi{t;N),...,Vd{t;N))^{Xi{t),...,Xd{t)), N^oo, 

where the r.v.s. X('^)(t) = {Xi{t), . . . ,X(i{t)) have a joint normal distribution 
with 

E{Xi{t)} = 0, Var{Xi(t)} = l, 
Cov{X,{t),Xj{t)} = r,j{t) = Cij{t)/ai{t)aj{t). 
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One can now infer from (20), (25) and (26) that the following convergence 
in distribution holds: 

d 

(27) W,it; N) 4 Yiit) = ai(t)[l - Piit)]Xi{t) - pi{t) J] akit)Xk{t), 

observing the fact that U{t;N)/N — > M{t) a.s. in accordance with the LLN. 
From (27) and (19), it follows that for every i = 1,2, . . . ,d 

d 

(28) Y,{t) = Y,aki{t)Xk{t) 

k=l 

is a linear combination of multivariate normal r.v.s. so that Yi{t) is normally 
distributed (see, e.g., [8], Chapter 3). Then from (28), one has 

d d 

Var{y,(t)} = Y.Y.^{Xk{t)Xi{t)}aki{t)auit). 

k=l 1=1 

Using (19) one arrives at (22). Formula (24) follows directly from (26) 
and (27). On the other hand, one can use (28) to write 

(29) Y('=)(t) = x('^)(t)Arfxfc(t), A; = l,2,...,d. 

Formula (24) follows from (29) since the vector 'Y^^\t) is a linear trans- 
formation of the multivariate normal vector X^'^) (t) with covariance matrix 
RW(t) = ||r,,(t)|| . □ 

Corollary 1. By (19) and (22), it is not difficult to derive 
du{t)^Sf{t) 

(30) 

= {i-p,{t)faKt)+piit) E Cki{t)-2p,m-p,{t))Y,akit). 

k,lj^i k^i 

To prove this, one only has to check that 

(afit){l-p,it)f, k = l = i, 

rki{t)aki{t)aii{t) = i -p^(t){l - pi{t))Cik{t), k^i,l = i, 
y-p,{t){l-p,{t))Ca{t), k = i,l^i. 

While condition (9) implies that the fractions Ai{t; N),i = 1,2, ... ,d, are 
linearly dependent, there exist d — 1 joint normal distributions of lower di- 
mensions that are asymptotically nondegenerate. For example, consider the 
vector 

A(^-^\t; N) = (Ai(t; iV), A2(t; N),..., A,„i(t; N)). 
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This vector has a limiting (d — 1)- dimensional joint normal distribution 
with mean-vector K{A.^'^~^\t; N)} = {pi{t),p2{t), . . . ,Pfi_i{t)) and covari- 
ance matrix B^'^-'^^t) / N ]VP {t) , where the matrix T>^'^-^\t) = \\dij{t)\\ is 
defined by (24) with k = d — 1. 



Corollary 2. Ifi^j, then 
(31) dij (t) = dj (t) +p, {t)pj (t) J2 Cki (t) - Pi it) Ckj (t) - p, it) Cu it) . 

k,l k I 

This result follows immediately from (19) and (24). Indeed, from (24), one 
has 

d 

dij{t)= Y rki{t)aki{t)aij{t), 

k,l=l 

and using (19) one obtains 

(p^{t)pj{t)Ckl{t), k^i,l^j, 

y{i-Pi{t)){i-p,{t))Ci,{t), k=i,i=j. 

5. Statistical applications in relation to cell proliferation. As was pointed 
out in the Introduction and in Section 2, there are experimental situa- 
tions where analyzing the relative frequencies, Ai{t;N), of cell types rather 
than the total cell counts Zi(t;N),i = l,2,...,d may be quite advanta- 
geous. Should this be the case, the property of asymptotic normality could 
be useful in developing the needed statistical inference of model parame- 
ters from experimental data. In particular, the following observation pro- 
cess is directly relevant to quantitative studies of proliferation, differentia- 
tion and death of cells. Suppose that the process under study begins with 
N = J2k=i ^k cells of type Ti and the values of are all large, that is, 
A^'o = min{A''i, A''2, . . . , Nn} — > oo. The descendants of the first A''i ancestors 
are examined only once at time ti to determine empirical counterparts of 
Ai{ti;Ni),i= l,2,...,d, whereupon the observation process is discontin- 
ued (i.e., the cells under examination are destroyed). At the next moment 
^2 ^ ^1) the fractions Ai{t2;N2),i = 1,2, . . . ,d, related to the descendants of 
the second N2 ancestors are observed, and so on. This procedure results in 
n independent observations of the form: 

Cfc = A{tk]Nk) = {Ai{tk;Nk),A2{tk;Nk), Ad{tk;Nk)), 

ti<t2<-- - <tn, k = l,2,...,n. 
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with each vector being asymptotically normal in accordance with The- 
orem 1. Denoting the corresponding contribution to the asymptotic log- 
likelihood function by Lk{Ck'^tk, N^), the overall asymptotic log-likelihood is 
given by 

n 

(32) A„(Ci, C2, • • • , Cn) = E Lk{Ck;tk,Nk). 

k=l 

The asymptotic log- likelihood (32) depends solely on parameters of indi- 
vidual multitype processes arising from a single initiator cell of type Ti. It 
is the latter parameters that are of primary interest in cell kinetics studies; 
they can be estimated from the data on relative frequencies by maximizing 
the log- likelihood (32). It should be emphasized that the only rationale for 
resorting to the asymptotic likelihood is that the ordinary likelihood is not 
readily available for partially observed branching processes of such complex- 
ity. Two more specific examples are given below. 

Example 1. Let d = 2 and assume that Z{t;N) = {Zi{t; N) , Z2{t; N)) ,t = 
0,1,2,..., is a Bienayme-Galton- Watson (BGW) branching process. This 
model represents a powerful tool in the analysis of time-lapse data generated 
via video-recording of individual cell evolutions. It is well known that the 
BGW process is entirely determined by the offspring p.g.f. 

(33) /ii(si,S2)=E{sf^(')s^^(') |Z,(0) = 1}, i = l,2. 

The first and second moments of the offspring distribution are derived from 
(32) in the usual way, that is, 

d 

(34) = — /ii(si,S2)|si=s2=i> ^,j = l,2, 

(35) 6*fc = ^^-^/ii(si,S2)|si=s2=i, «,J,^ = 1,2. 

Using (36) and (37), let us derive the moments defined in (3)-(6) for every 

generation t = 1,2, First of all, note that M(t) = ||m.y(t)|| = M*, where 

mij{t)=¥.{Zj{t) I Zi(0) = 1} and m=\\mij\\. Let ^^^{t) =W.{Zj{t){Zk{t) - 
6jk) I Zi{0) = 1}, where 6jk = 1, j = k and 5jk = 0,j / k. Note also that 
mjil) = mj and = b]^. 

Using the recurrence formula, 

22 2 

(36) h]^{t+l) = Y,Y.^imij{t)mrk{t)+Y,milb'jk{t), t=l,2,..., 

1=1 r=l 1=1 
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it follows from Theorem 1 and formula (32) that 
(37) 



1 

A„ (Ci , C2 , . . . , Cn) = - ^ log 27r - - ^ log ^2 (ifc ; iV^) 

fc=i 



1 " 



i,Y.'^Ck-p{tk)?/S\tk;Nk), 

^ k=l 

where Cfe = A{tk; Nk),p{tk) = mu{tk)/M{t) and M{t) = [mii(tfc) +mi2(tfc)]. 
Taking (30) into account, one obtains 

5^(t.;iV.) = ^^^M(t.)[i-P(4)]^ 

where, by virtue of (5) and (6), one has to set crf{ti^) = bj-{tk) + mii{tk) — 
'^ii(tk),i = 1,2, and C^it) = ^JsW - mu{t)mi2{t). 

The log-likelihood A„ can now be constructed [proceeding from (37)] as 
a function of the observations (Cii C2) • • • i Cn) and the moments {niij} and 
{^jfc}' hjik= 1,2, to obtain asymptotic maximum-likelihood estimates of 
the model parameters of interest. 

Example 2. Recalling the model of oligodendrocyte generation in cell 
culture discussed in Section 2, consider a two- type reducible Bellman-Harris 
process {Zi{t;N),Z2{t;N)), t>0, with offspring p.g.f. 

(38) hi{si,S2)=Po+Pisl+p2S2, h{l,l) =po+pi+p2 = l. 

In this process, the life-span of every progenitor (type Ti) has cumulative 
distribution function Gi{t). At the end of its life (mitotic cycle), every pro- 
genitor cell either dies with probability po, or divides into two new Ti cells 
with probability pi, or differentiates into a new cell type T2 (oligodendro- 
cyte) with probability p2- Every cell of type T2 has its life-span distributed 
in accordance with G2{t) and, at the end of its life, it dies without giving 
rise to any progeny, that is, its offspring p.g.f. is /i2(si,S2) = 1. 

By conditioning on the evolution of the first progenitor cell and applying 
the law of the total probability, one can establish that the p.g.f.s 

Flit- si, S2) = E{sf I Zi(0) = 1}, Fi(0; 51,^2) = si, 

F2(t;s2)=E{s2'^*^ 1^2(0) = 1}, F2iO;s2) = S2, 

satisfy the following equations: 

Flit; si, S2) = r hiiFiit - u; si, S2), i^2(i - u; S2)) ddiu) + si(l - Gi(t)), 
Jo 

F2it;S2)=S2il-G2it)) + G2it). 
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The first and second moments of tlie process are readily derived from these 
equations. Then Theorem 1 can be applied to obtain maximum likelihood 
estimates of the parameters incorporated into the model. 

The above line of reasoning applies to the so-called clonal analysis, that is, 
the analysis of cell counts in mixed clones obtained at discrete moments of 
time. However, in time-lapse experiments (see, e.g., [14]), it is also possible 
to observe the fractions Aj(ifc) in the embedded discrete time branching 
structure determined by the sizes of consecutive generations. In general, 
the embedded discrete time process of a d-type Bellman-Harris branching 
process is a d-type BGW process with the same offspring distributions. In 
this case, formulas (33)-(35) and (38) yield 

mii = 2pi, mi2=P2, 77221 = 77122=0, 

b\i = 2pi, hjj^ = for all other indices j, k, 
al = ipi{l -pi), o"| = 0, Ci2 = -2pip2- 
Hence, one has 

mn{t) = {2piY, mu{t) = {2pif-'p2, bl,{t) = t{2p^Y, 
al{t) = {2pi)\t + l-2pi), ai(t) = 0, Cuit) = -p2{2pi)^'-^ 

for t= 1,2,.... 

Surprisingly, in this particular case 

(39) p(tk) =mu{tk)/[mii(tk) +mi2{tk)] = 2pi/{2pi +P2) 

is a constant for every t. 

Therefore, from (30) one obtains 

S\tk;Nk) = Sf{tk)/NkM\tk) 

(40) 

_ 1 / 2pi VL ,^ o / P2 
Nk\2pi+p2j I ^ \2pi+p2 

The required log-likelihood An{pi,P2) = A.„(Ci, C21 • • • > C I PI1P2) follows 
immediately from (39), (40) and (37). 

Remark. Quine [21] investigated the moment structure of the multitype 
Galton- Watson process and derived useful linear recurrence relations. 

6. Discussion and concluding remarks. Since the paper by Jagers [15], 
the concept of relative frequencies (cell fractions) has attracted little atten- 
tion of investigators in the field of branching stochastic processes. This is 
unfortunate in view of the need for pertinent methods of stochastic model- 
ing and statistical analysis of the fractions of cells rather than their counts 
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in the area of cell biology. Published in 1971, a subsequent work by Mode 
[20] is evidence in favor of this opinion. Mode built his cell cycle analysis 
on a model of multitype positively regular age-dependent branching pro- 
cess. In the supercritical case, he proved that lim Afc(t) = 5k a.s. as i — > oo, 
providing the population does not become extinct. It should be noted that 
the constants 6k,k = 1,2, ... ,d, depend only on the offspring characteristics. 
In fact, 6k = Vk/J2j=ii'j, wheice Vk = r]k{l - Gl{a)), rj = {r]i,r]2, ■ ■ ■ ,r]d) is 
a left eigenvector of the matrix H(a) = ||G^(a)mjj|| with the Malthusian 
parameter a, while G^(A) is the Laplace-Stieltjes transform of the life-span 
distribution Gk{t) for the type Tk,k = 1,2, ... ,d. In his monograph. Mode 
[19] also considered the utility of fractions and reported a similar result for 
the BGW process. 

Methods of statistical inference for branching processes with an increasing 
number of ancestors were developed by Yanev [26] (see also [9]), Dion and 
Yanev [4, 5, 6] and reviewed later by Yanev [28]. A diffusion approximation 
for the classical BGW process with a large number of ancestors in the near- 
critical case was introduced by Feller [7] and developed further by Lamperti 
[18] and others. Some of these results were summarized and discussed by 
Jagers [16]. The work of Lamperti [18] also reports some interesting limiting 
distributions. 

However, the main focus has always been on the numbers of individuals 
(cells) of different types and not on their relative frequencies. In the present 
paper, we make another step in the same direction by considering the asymp- 
totic behavior of the fractions Afc(t) as the initial number of ancestors tends 
to infinity but the time t is fixed. The convergence results established for 
Afc(t) may have far-reaching statistical implications. 

The results obtained by Jagers and Mode suggest that it would be inter- 
esting to investigate the asymptotic behavior of the fractions Ak{t; N) when 
both parameters and t tend to infinity simultaneously. It is anticipated 
that such asymptotic properties will depend on a specific branching model 
and its reducibility. They are also expected to be different for supercritical, 
critical, and subcritical processes. 

Yet another open problem has to do with correlations between times to 
division for sister cells. As conjectured by Harris [11], the mean number of 
cells is not affected by this type of correlation while the variance can only be 
larger than that in the independent case. Crump and Mode [3] were the first 
to systematically study an age-dependent branching model under which the 
life-spans of sister cells are correlated as well as the numbers of offspring 
of sister cells, but otherwise the cells live and reproduce independently. A 
bifurcating autoregressive branching process [10] represents another relevant 
example. Asymptotic properties of such processes as N or/and t tend to 
infinity have yet to be explored. 
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